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Background: 3CL?"° protease is required for coronaviral polyprotein processing and is only active as a dimer. 
Results: MERS-CoV 3CL?”° is a weakly associated dimer requiring ligand binding for dimer formation. 
Conclusion: Ligand-induced dimerization is a key mechanism for regulating the enzymatic activity of MERS-CoV 3CLP*° 


during polyprotein processing. 


Significance: Activation via ligand-induced dimerization may add complexity for the development of MERS-CoV 3CLP*° 


inhibitors as antivirals. 


All coronaviruses, including the recently emerged Middle 
East respiratory syndrome coronavirus (MERS-CoV) from the 
B-CoV subgroup, require the proteolytic activity of the nsp5 
protease (also known as 3C-like protease, 3CL?"°) during virus 
replication, making it a high value target for the development of 
anti-coronavirus therapeutics. Kinetic studies indicate that in 
contrast to 3CLP*° from other B-CoV 2c members, including 
HKU4 and HKU5, MERS-CoV 3CL?”® is less efficient at pro- 
cessing a peptide substrate due to MERS-CoV 3CL?” being a 
weakly associated dimer. Conversely, HKU4, HKU5, and SARS- 
CoV 3CLP*° enzymes are tightly associated dimers. Analytical 
ultracentrifugation studies support that MERS-CoV 3CL?”® is a 
weakly associated dimer (K, ~52 pM) with a slow off-rate. Pep- 
tidomimetic inhibitors of MERS-CoV 3CLP"° were synthesized 
and utilized in analytical ultracentrifugation experiments and 
demonstrate that MERS-CoV 3CL?*"° undergoes significant 
ligand-induced dimerization. Kinetic studies also revealed that 
designed reversible inhibitors act as activators at a low com- 
pound concentration as a result of induced dimerization. Pri- 
mary sequence comparisons and x-ray structural analyses of two 
MERS-CoV 3CLpro and inhibitor complexes, determined to 1.6 
A, reveal remarkable structural similarity of the dimer interface 
with 3CLP*° from HKU4-CoV and HKU5-CoV. Despite this 
structural similarity, substantial differences in the dimerization 
ability suggest that long range interactions by the nonconserved 
amino acids distant from the dimer interface may control 
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MERS-CoV 3CLP*° dimerization. Activation of MERS-CoV 
3CLP*° through ligand-induced dimerization appears to be 
unique within the genogroup 2c and may potentially increase 
the complexity in the development of MERS-CoV 3CLP”® inhib- 
itors as antiviral agents. 


Coronaviruses (CoVs)? are enveloped, positive-strand RNA 
viruses that infect a variety of vertebrates, including bats, live- 
stock, pets, poultry, and humans (1-3). Although human CoVs 
cause respiratory illnesses of mild to moderate severity (4-9), 
two recently emerged CoVs, severe acute respiratory syndrome 
coronavirus (SARS-CoV) and Middle East respiratory syn- 
drome coronavirus (MERS-CoV), have demonstrated their 
potential to become a serious threat to public health. MERS- 
CoV emerged late in 2012, and unlike its predecessor SARS- 
CoV, MERS-CoV continues to exhibit up to a 35% fatality rate 
(10-12). 

Based on the sequence analysis of seven genes of the replicase 
domain, MERS-CoV has been classified as a B-CoV genogroup 
2c member, along with closely related bat coronaviruses HKU5 
(Pipistrellus bat) and HKU4 (Tylonycteris bat) (13, 14). Increas- 
ing evidence suggests that bats may serve as zoonotic reservoirs 
for MERS-CoV (15, 16). Evidence presented by recent studies 
also supports the local zoonotic transmission of MERS-CoV 
from dromedary camels to humans (17, 18). Alarmingly, 
human-to-human transmission during close contact, especially 
in elderly or patients with underlying health conditions, has 
also been reported for MERS-CoV (19-22). In the wake of the 
recent upsurge in the laboratory-confirmed cases of MERS- 
CoV, including two recently identified cases in the United 


3 The abbreviations used are: CoV, coronavirus; MERS, Middle East respiratory 
syndrome; SARS, severe acute respiratory syndrome; nsp, nonstructural 
protein; 3CL°’°, 3-chymotrypsin-like protease; AUC, analytical ultracentri- 
fugation; SV, sedimentation velocity; BME, B-mercaptoethanol; BisTris, 
2-[bis(2-hydroxyethyl)amino]-2-(hydroxymethyl) propane-1,3-diol; 
PDB, Protein Data Bank. 


JOURNAL OF BIOLOGICAL CHEMISTRY 19403 


SIOZ ‘OT IsNSny UO YSingsyig Jo AjIsIOATUA ye /SIO‘Og! MMmM//:dyy Wo pepeopumog 


Ligand-induced Dimerization Regulates MERS-CoV 3CL?"° 


States (23), there is an urgent need to study and characterize the 
properties of important drug targets of MERS-CoV for the 
development of effective therapeutics. 

Coronaviruses express a >800-kDa replicase polyprotein, 
which is processed by viral 3CL*° protease (or nsp5) at 11 dis- 
tinct cleavage sites to yield intermediate and mature nonstruc- 
tural proteins (nsp) responsible for many aspects of virus repli- 
cation (3, 24—26). Because of its indispensable role in the virus 
life cycle, 3CL*° is an important target for therapeutic inter- 
vention against coronavirus infections (27-33). 

Anumber of kinetic, biophysical, and x-ray structural studies 
have demonstrated that SARS-CoV 3CLP"® is only active in 
vitro as a tightly associated dimer with a dimer dissociation 
constant (K,) in the low nanomolar range (34-38). The addi- 
tion or deletion of amino acids, e.g. His, affinity tags, at either 
the N or C terminus drastically reduces the enzymatic rate and 
decreases the ability of SARS-CoV 3CL?° to dimerize (37). 
Although cellular evidence for the auto-cleavage mechanism 
(cis versus trans) of 3CLP"® is lacking, models for how 3CLP"° 
cleaves itself from the polyprotein to form the mature dimer 
have been proposed based on in vitro studies using purified 
3CLP’° (34, 39, 40). A current model posits that two inactive 
3CLP*° molecules within two separate polyproteins recognize 
each other and form an immature dimer capable of cleaving the 
nsp4 | nsp5 and nsp5 | nsp6 sites in trans, followed by forma- 
tion of an active and mature dimer that can then rapidly process 
other cleavage sites and multiple polyproteins. It has also been 
proposed that substrate-induced dimerization regulates the 
enzymatic activity of SARS-CoV 3CL?*° during virus replica- 
tion; however, no experimental evidence of this has ever been 
demonstrated in infected cells (40). Although our knowledge of 
SARS-CoV 3CL?"? is extensive, the dimerization properties of 
3CLP*° from MERS-CoV and other coronaviruses, as well as the 
factors regulating their enzymatic activity, remain largely 
unknown. 

To understand the properties of MERS-CoV 3CL?"®, we con- 
ducted a series of kinetic, biophysical and x-ray structural stud- 
ies. Here, we report a detailed kinetic and biophysical analysis of 
MERS-CoV 3CLP"° activity and dimerization. These kinetic 
and biophysical studies provide evidence for a weakly associ- 
ated MERS-CoV 3CL?”° dimer. In addition, we utilized our pre- 
vious knowledge on the design of potent SARS-CoV 3CLP"° 
peptidic inhibitors to design a series of inhibitors of MERS-CoV 
3CLP”® that exhibit low micromolar potency. We demonstrate 
that MERS-CoV 3CLP*® requires the binding of a ligand for 
dimer formation, indicating that ligand-induced dimerization 
is likely a key mechanism in the regulation of MERS-CoV 
3CLP"® activity during virus infection. 


Experimental Procedures 


Construct Design and Expression of MERS-CoV 3CL?’"°—The 
gene encoding 3CLP"° protease of MERS-CoV (amino acid res- 
idues 3248-3553 in the replicase polyprotein, GenBank’ 
accession number AHC74086.1) was codon-optimized for 
optimal expression in E. coli (BioBasic Inc). The gene was sub- 
cloned into pET-11a expression vector with an N-terminal His, 
tag followed by the nsp4 | nsp5 auto-cleavage site using the 
forward primer 5'-ATATACATATGCACCACCACCAC- 
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CACCACAGCGGTGTTCTGCAGTCTGGTC-3’ and the 
reverse primer 5'-GACGGATCCTTACTGCATCACAA- 
CACCCATGATCTGC-3’. The construct was verified by DNA 
sequencing at the Purdue University Genomics Core Facility. 
This construct results in the expression of MERS-CoV 3CLP*° 
without any N- or C-terminal extensions. MERS-CoV 3CL?r° 
was expressed through auto-induction in Escherichia coli 
BL21-DE3 cells in the presence of 100 ug/ml carbenicillin as 
described previously (41). Cells were harvested by centrifuga- 
tion at 5000 X g for 20 min at 4 °C, and the pellets were stored at 
—80 °C until further use. 

MERS-CoV 3CL””? Purification—Frozen pellets from 4 liters 
of bacterial cell culture were thawed on ice and resuspended in 
250 ml of Buffer A (20 mm Tris, pH 7.5, 0.05 mm EDTA, 10% 
glycerol, and 5 mm B-mercaptoethanol (BME)), containing 500 
pg of lysozyme and a small amount of DNase. Cells were then 
lysed using a single pass through a French press at 1200 p.s.i., 
and cell debris was removed from the cleared lysate by centri- 
fuging at 29,000 x g for 30 min. Solid ammonium sulfate was 
added to the cleared lysate to a final concentration of 1 M 
through gradual mixing on ice. 

Hydrophobic Interaction Chromatography—The cleared 
lysate, mixed with ammonium sulfate, was loaded at a flow rate 
of 3 ml/min onto a 60-ml phenyl-Sepharose 6 fast-flow high- 
sub column (Amersham Biosciences) equilibrated with Buffer 
B (50 mo Tris, pH 7.5, 1 Mammonium sulfate, 0.05 mm EDTA, 
10% glycerol, and 5 mm BME). The column was then washed 
with 5X column volume (300 ml) of Buffer B at a flow rate of 4 
ml/min. Protein was eluted using a 5X column volume (300 ml) 
linear gradient to 100% Buffer A. Fractions (12 ml) were col- 
lected, and those containing MERS-CoV 3CL?"°, as judged 
through SDS-PAGE analysis and specific activity measure- 
ments, were pooled (120 ml) and exchanged into 2 liters of 
Buffer A via overnight dialysis in a 10,000 molecular weight 
cutoff SnakeSkin® dialysis tubing (Thermo Scientific). 

DEAE Anion-exchange Chromatography—Dialyzed sample 
from the previous step was loaded at a flow rate of 3 ml/min 
onto a 120- ml DEAE anion-exchange column (Amersham Bio- 
sciences) equilibrated with Buffer A. The column was then 
washed with 2X column volume (240 ml) of Buffer A at a flow 
rate of 4 ml/min. A linear gradient (total volume 480 ml) to 40% 
Buffer C (50 mm Tris, pH 7.5, 1 M NaCl, 0.05 mm EDTA, 10% 
glycerol, and 5 mm BME) was used to elute the protein. Frac- 
tions (6 ml) were collected, and those containing MERS-CoV 
3CLP*° were pooled (66 ml) and dialyzed for 4 h in 4 liters of 
Buffer D (20 mm MES, pH 5.5, 0.05 mm EDTA, 10% glycerol, 
and 5 mM BME). 

Mono S Cation-exchange Chromatography—Following dial- 
ysis, the pH of the sample was manually adjusted to 5.5 using 1 
M solution of MES, pH 5.5, and any precipitated protein was 
removed by filtering through a 0.22-4m pore size Millex-GP 
filter (Millipore). The filtered sample was then loaded at a flow 
rate of 2 ml/min onto an 8-ml Mono S 10/100 column (Amer- 
sham Biosciences) equilibrated in Buffer D. The column was 
then washed with 5X column volume (40 ml) of Buffer D at a 
flow rate of 2 ml/min. Protein was eluted using a 25x column 
volume (200 ml) and a linear gradient to 50% Buffer E (50 mm 
MES, pH 5.5, 1 M NaCl, 0.05 mM EDTA, 10% glycerol, and 5 mm 
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BME). Fractions (2 ml) were collected, and those containing 
MERS-CoV 3CL?P"® were pooled (22 ml) and concentrated to 
~5 mg/ml. 

Gel Filtration Chromatography—As the final purification 
step, the concentrated protein sample was loaded onto the 
preparation grade Superdex 75 26/60 gel filtration column 
(Amersham Biosciences) equilibrated with Buffer F (25 mm 
HEPES, pH 7.5, 10% glycerol, 2.5 mm dithiothreitol (DTT)). 
Protein was eluted isocratically at a flow rate of 1 ml/min with 
Buffer F. Fractions (2 ml) containing MERS-CoV 3CL?"° were 
pooled (total volume of 34 ml) and concentrated to ~5 mg/ml. 
For final storage of the purified MERS-CoV 3CL?*° enzyme, 
300-1 protein aliquots were placed into 1-ml screw-cap vials, 
flash-frozen under liquid nitrogen, and then stored at —80 °C 
until further use. 

Purification of SARS-CoV, HKU4-CoV, and HKUS-CoV 
3CL?’°—SARS-CoV 3CLP*® and HKU5-CoV 3CL?*° with 
authentic N and C termini were expressed and purified as 
described previously (37, 42). HKU4-CoV 3CL?"® was purified 
utilizing a modified protocol from Ref. 42. Final protein yield 
was calculated based on the measurement of total activity units 
(uM product/min), specific activity (units/mg), and milligrams 
of protein obtained (Bio-Rad protein assay) after each chro- 
matographic step. 

Synthesis of Compounds 1-11—The peptidomimetic com- 
pounds with Michael acceptor groups (compounds 1-9, Table 
3) were synthesized via very similar methods to those published 
previously (30, 43). Synthesis of noncovalent peptidomimetic 
compounds 10 and 11 (Table 3) has been described previously 
(33). 

Fluorescence-based Kinetic Assays—The enzymatic activity 
of 3CLP*° was measured using the following custom-synthe- 
sized peptide: HilyteFluor’-488-ESATLQSGLRKAK-(QXL™™’- 
520)-NH., (AnaSpec, Inc.). The HilyteFluor™™-488  fluo- 
rescence group was internally quenched by QXL™-520 dye. 
This substrate works as a generic peptide substrate for 3CLP’° 
enzymes and was designed based on the nsp4 | nsp5 cleavage 
sequence for many coronavirus 3CLP*° enzymes. The rate of 
enzymatic activity was determined at 25 °C by following the 
increase in fluorescence (Agxcitation = 485 NM, Agmission = 228 
nm, bandwidths = 20 nm) of Hilyte Fluor’™-488 upon peptide 
hydrolysis by the enzyme as a function of time. Assays were 
conducted in black, half-area, 96-well plates (Corning Glass) in 
assay buffer (50 mm HEPES, pH 7.5, 0.1 mg/ml BSA, 0.01% 
Triton X-100, and 2 mm DTT) using a final reaction volume of 
100 pl. The resulting florescence was monitored using a BioTek 
Synergy H1 plate reader. The rate of the reaction in arbitrary 
fluorescence units/s (AFU/s) was determined by measuring the 
initial slope of the progress curves, which were then converted 
to units of micromolars of product produced per min (M/min) 
using experimentally determined values of fluorescence 
“extinction coefficient” as described previously (37). All reac- 
tions were carried out in triplicate. 

Determination of Enzymatic Efficiency—The apparent enzy- 
matic efficiency for each of the 3CLP*° enzymes was determined 
by measuring the rate of enzymatic activity as a function of 
varying substrate concentration in 100-jl reactions. Reactions 
were initiated by the addition of enzyme to the wells of an assay 
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plate containing varying concentrations of substrate. The final 
substrate concentrations varied over a range from 0 to 2 pM. 
The final enzyme concentrations for each 3CL?*° studied were 
as follows: MERS-CoV 3CLP"° at 1 wm, SARS-CoV 3CLP*® at 
100 nm, HKU5-CoV 3CL?”® at 250 nm, and HKU4-CoV 3CLP"® 
at 200 nm. Because 3CLP*° enzymes cannot be saturated with 
this substrate at a substrate concentration that would still allow 
accurate fluorescent measurements without the inner filter 
effect, only the apparent k.,,/K,,, values can be determined from 
the slope of the line that results from a plot of the enzymatic 
activity (y axis), normalized for the total enzyme concentration, 
against the substrate concentration (x axis). 

Influence of Dimerization on the Activity of 3CL?”? Enzymes— 
The dependence of the enzymatic activity on the total enzyme 
concentration was determined using the FRET-based assay 
described above. The final enzyme concentrations were varied 
over a concentration range from 2 uM to 100 nm for MERS-CoV 
3CLP", 500 to 10 nm for SARS-CoV 3CL?P"°, 250 to 0.6 nm for 
HKU5-CoV 3CL?"®, and 200 to 10 nm for HKU4-CoV 3CL?P”®. 
Reactions were initiated by the addition of substrate, at a final 
concentration of 2 jM, to the assay plates containing varying 
enzyme concentrations in the assay buffer. Initial rates were 
determined from the initial slopes of the progress curves at each 
enzyme concentration. 

The rates of the 3CL?"°-catalyzed reactions measured over a 
range of enzyme concentrations can be fit to either Equation 1 
or 2 to determine the values of the dissociation constant for the 
monomer-dimer equilibrium as well as the turnover numbers. 
Nonlinear regression and the program TableCurve 2D version 
4.0 were used to fit the data to either Equation 1 or 2 below (44). 


— Ky + \Ki + 8K4C; 
Kati 4 


Vinax a 


Ky + 4c, — Ki + 8KgC; 
cat,D 8 (Eq. 1) 


In Equation 1, V,,,,,, is the rate of the enzymatic activity calcu- 
lated at each enzyme concentration (C,); K, is the monomer- 
dimer equilibrium dissociation constant, and k..4, 4 and keat, p 
are the turnover numbers for the monomer and the dimer, 
respectively. 


Ky + 4c, ae Ke + 8KiC, 
8 


Vmax = k.a(D] = Kat (Eq. 2) 
In Equation 2, V,,,.,.. C7, and K, have been described previously, 
and k,,, is the turnover number for the dimer only. 

Inhibition Assays—To determine the percent inhibition for 
compounds 1-9, the total concentration of the substrate was 
fixed at 1.0 xm, and the enzymes were fixed at 250 no for SARS- 
CoV 3CLP°, HKU5-CoV 3CLP"°, HKU4-CoV 3CL?**, and at 
500 nm for MERS-CoV 3CL?*"®. DMSO stocks (100) of the 
compounds were diluted a hundred-fold to a final concentra- 
tion of 50 xm in 80 pl of the enzyme solution and incubated for 
20 min. After 20 min, the enzymatic activity was measured as 
initial slope of the progress curve, obtained by initiating the 
reaction with 20 pl of 5 uM substrate. % inhibition was calcu- 
lated using Equation 3. 
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eae pts (ratesample a rateneg) 
% inhibition = | 1 xX 100 


(rate; — rateneg) 
(Eq. 3) 


In Equation 3, rate, mple 
in AFU/s measured in the presence of the compound; rate,,,,, is 
the initial slope measured in the absence of any compound, and 
rate, is the baseline substrate hydrolysis calculated in the 
absence of enzyme. All the reactions were carried out in tripli- 
cate and contained a final DMSO concentration of 1%. For 
compounds displaying more than 50% inhibition, a more exten- 
sive characterization of the inactivation kinetics was performed 
through progress curve analysis. To the reaction well, 20 pl of 5 
LM substrate was added to a final concentration of 1 xm, and 
the total inhibitor concentration [/],...; was varied from 0 to 50 
pm. The reaction was initiated with the addition of 80 pl of 
MERS-CoV 3CL?"® to a final concentration of 500 nm. Fluores- 
cence intensity was then measured over time as AFU, for a 
period of 70 min. Equation 4 describes the resulting time course 
of reaction. 


is the initial slope of the progress curve 


Vv 
[Pl = 7 


“( exp( Kopst) t [P]; 


obs 


(Eq. 4) 


In Equation 4, v; is the initial velocity of the reaction; k,,,, is the 
observed first-order rate constant for the reaction in the 
absence and presence of inhibitor; ft is the time in minutes; [P], 
is the concentration of product produced at time £, and [P], is 
the initial product concentration, which is zero. Product con- 
centrations were calculated from the values of AFU, using the 
experimentally determined fluorescence extinction coefficient. 
The resulting values of [P], were then plotted against time ¢, and 
the data were fit to Equation 4 with [P], = 0 using the nonlinear 
regression program TableCurve 2D to derive the fitted param- 
eters v, and k,,,, and their associated errors Av, and Ak,,,.. 


Values for each k,,,, were then plotted against [J],,.., and the 
data were fit to Equation 5. 
Kinac IN otal 
fe tl! Jtotal (Ea. 5) 


aaa 4 =I [eotat 


In Equation 5, k;,,,-¢ defines the maximum rate of inactivation at 
infinite inhibitor concentration, and K; defines the concentra- 
tion of inhibitor that yields a rate of inactivation equal to 
Vk, nact Lhe half-life of inactivation at infinite inhibitor concen- 
tration, which is a measure of inactivation efficiency, is defined 
as t% = 0.693/Kinact: 

AUC Analysis—To determine the oligomeric state of MERS- 
CoV 3CLP, sedimentation velocity experiments were per- 
formed at 20 °C on the Beckman-Coulter XLA ultracentrifuge 
using varying concentrations of MERS-CoV 3CLP*® (4-23 um) 
in 25 mm HEPES, pH 7.5, 50 mm NaCl, and 1 mM tris(2-car- 
boxyethyl)phosphine at 50,000 rpm. To characterize the effect 
of the ligand on the monomer-dimer equilibrium of MERS- 
CoV 3CLP*°, sedimentation velocity experiments were con- 
ducted on the Beckman-Coulter XLI instrument using different 
stoichiometric ratios of MERS-CoV 3CLP’° with compounds 6 
and 10. Samples were prepared by mixing 25 um MERS-CoV 
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3CLP’° with 25, 50, and 100 uM compound 6 or 10 and incu- 
bating the mixture overnight at 4°C before performing the 
experiments. Absorbance optics (280 nm) and interference 
optics were utilized for protein detection. Solvent density, vis- 
cosity, and partial specific volumes were calculated using 
SEDNTERP. SEDPHAT was used to fit the data to the mono- 
mer-dimer self-association model to estimate the sedimenta- 
tion coefficients (s), apparent molecular weights, and K, and 
Koee Values from size distribution analysis. To obtain exact 
molecular weights, sedimentation equilibrium experiments 
were performed at concentrations of 3 and 17 wm MERS-CoV 
3CLP"°. The experiments were done at 20°C utilizing a two- 
channel centerpiece and run at multiple speeds (8100, 13,800 
and 24,000 rpm) in a AN-60 Ti rotor. 

MERS-CoV 3CL?” Activation and Inhibition by a Noncova- 
lent Inhibitor—The rates of the MERS-CoV 3CLP"°-catalyzed 
reactions were determined at final enzyme concentrations of 
0.5, 1.0, and 2.0 wm and in the absence and presence of varying 
concentrations (0.1-60 um) of compound 10. The substrate 
concentration was fixed at 2.0 um. DMSO stocks (100) of 
compound 10 were diluted a hundred-fold in 80 pl of enzyme 
solution and incubated for 10 min. At the same time, a zero- 
inhibitor control reaction was set up by mixing DMSO to a final 
concentration of 1% into 80 ul of enzyme solution. After 10 
min, the rate of the enzymatic activity was measured as the 
initial slope of the progress curve, obtained by initiating the 
reaction with 20 pl of 10 xm substrate. Equation 6 was utilized 
to calculate the percent activity. 


(rat€sample — FatEeneg) 
% activity = “mee “x 100 
(rate,o, — rateneg) 


(Eq. 6) 


The rate... mple 
Equation 3. 

MERS-CoV 3CL?’? Crystallization, X-ray Data Collection, 
and Structure Determination—Purified MERS-CoV 3CLP"° 
was concentrated to 1.6 mg/ml in 25 mm HEPES, pH 7.5, and 
2.5 mm DTT. Inhibitor complexes of MERS-CoV 3CLP"° with 
compounds 6 and 11 were formed by incubating MERS-CoV 
3CLP"° with the compounds in a 1:3 stoichiometric ratio at 4 °C 
overnight. After iterative rounds of optimization of the crystal- 
lization conditions based on the initial hits obtained from high 
throughput screening of Qiagen Nextel Screens, crystals of 
MERS-CoV 3CL?”° inhibitor complexes suitable for x-ray dif- 
fraction were grown by the hanging-drop, vapor diffusion 
method at 20 °C in 0.2 M sodium acetate, 0.1 M BisTris, pH 7.0, 
and 20% PEG-3350 for the MERS-CoV 3CL?"® and 6 complex, 
and 0.2 M ammonium acetate, 0.1 M BisTris, pH 5.5, 12% PEG- 
3350 for the MERS-CoV 3CL?®° and 11 complex. For x-ray data 
collection, crystals were flash-cooled in liquid nitrogen after 
dragging the crystals through a cryo-solution that con- 
tained the crystallization solution supplemented with 15% 
2-methyl-2,4-pentanediol. 

X-ray diffraction data were collected for MERS-CoV 3CLP*° 
and 6 and MERS-CoV 3CL?*° and 11 complexes at the Lilly 
Research Laboratories Collaborative Access Team (LRL-CAT) 
Sector 31 and the Life Sciences Collaborative Access Team (LS- 
CAT) Sector 21 at the Advanced Photon Source, Argonne 


, rate,,,, and rate,., are as described above for 


pos’ 
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TABLE 1 
Purification summary of MERS-CoV 3CLP"? per liter of E.coli BL21-DE3 cells 
Sample Protein Total activity units Specific activity Fold purification % yield 
mg units/mg 
Lysate 1102 1168 1 1 100 
Phenyl-Sepharose 219 185 1 ul 16 
DEAE 22 189 8 8 16 
Mono S 15 142 9 9 12 
Superdex 75 12 114 10 10 10 
A. B. 
— 0:30) -¢ MERS-Cov 
'S 0.254 -& SARS-CoV 
— -@ HKU5-CoV 3CLP° Enzyme | *Keat/Ky (x 102) (UM™.min”) 
re 0.20 
2 iicaiiinal MERS-CoV 3.1 £0.03 
Bp 0-15 SARS-CoV 15.54 0.9 
i, 0.10 HKU5-CoV 8.8 +0.1 
3 HKU4-CoV 11.3+0.4 


0.0 0.5 1.0 1.5 2.0 
[Substrate] uM 


FIGURE 1. Comparison of enzymatic efficiencies (k.,./K,,,) of 3CL°"° enzymes from different CoVs. A, rates for the enzymatic activity, normalized to the total 
enzyme concentration, are plotted as a function of varying substrate concentrations. Total concentration of each enzyme in the final reaction is as follows: 
MERS-CoV 3CLP"° at 1 pxm; SARS-CoV 3CLP"° at 100 nm; HKU5-CoV 3CL""° at 250 nm; and HKU4-CoV 3CLP"? at 200 no. Slope of the line represents the apparent 
value of k../K,,. Error bars represent the standard deviation for triplicate data. B, *, apparent value of k.,,/K,,, for the nonsaturable substrate, calculated as the 


slope of the linear plot from panel A. 


National Laboratory, respectively. Data were processed and 
scaled using Mosflm version 7.0.5 (45) and HKL2000 version 
706 (46). The method of molecular replacement was used to 
obtain initial phases using the program PHASER-MR in Phenix 
suite version 1.8.4 (47). For MERS-CoV 3CL?"® and 6 complex, 
the x-ray structure of SARS-CoV 3CLP*° (PDB code 3V3M) was 
used as a phasing model (32). The final MERS-CoV 3CLP"® and 
6 complex structure was then used to calculate the initial 
phases for the MERS-CoV 3CLP’° and 11 complex model. 
Automated model building using Autobuild in Phenix was ini- 
tially used to build a preliminary model of the MERS-CoV 
3CLP*° and 6 inhibitor complex. Each structure was then 
refined using iterative cycles of refinement using Phenix Refine 
coupled to manual model building using COOT (48) based on 
F, — F,and 2F, — F, maps. Coordinates and molecular library 
files for inhibitor molecules were built using the program 
eLBOW in the Phenix suite. Water molecules were added to 
peaks in residual (F,, — F.) density maps that were greater than 
30 using the “Find Water” function in COOT. MolProbity was 
used to assess structural quality of the final model (49). The 
measured structure factor amplitudes and the atomic coordi- 
nates for the final structures were deposited in the Protein Data 
Bank with accession codes 4RSP (MERS-CoV 3CL?*® and 6 
complex) and 4YLU (MERS-CoV 3CLP’° and 11 complex), 
respectively. Structural superposition was performed using the 
method of least squares fitting of C-a atoms in COOT. PyYMOL 
was used to generate figures of all the structures (50). 


Results 

Production of MERS-CoV 3CL?’? with Authentic N and C 
Termini—Insertion of the nsp4 | nsp5 cleavage site between 
the N-terminal His, tag and the coding region for MERS-CoV 
3CLP"° results in autoprocessing of the His tag and overexpres- 
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sion of MERS-CoV 3CL?"® without any N-terminal extension in 
E. coli BL21-DE3 cells. MERS-CoV 3CL?"® was purified to high 
purity and an overall yield of 10% using four sequential chro- 
matographic steps. A summary of the percent enzyme yield, 
total activity units, and the fold-purification after each chro- 
matographic step is summarized in Table 1. Approximately 12 
mg of highly pure MERS-CoV 3CL?"° can be obtained per liter 
of bacterial cell culture. 

To verify the production of the enzyme with correct N and C 
termini, the molecular mass of purified MERS-CoV 3CL?”° was 
determined by MALDI to be 33.4 kDa, which is close to the 
theoretical molecular mass of 33.3 kDa for the authentic/ma- 
ture MERS-CoV 3CL?*° monomer. Western blot analysis of 
purified MERS-CoV 3CLP° using an anti-His, antibody also 
confirmed the absence of the N terminus His, tag associated 
with the expression plasmid (data not shown). These results 
demonstrate that the N-terminal His, tag is auto-catalytically 
removed by MERS-CoV 3CL?"° during its expression in E. coli, 
indicating MERS-CoV 3CLP*® is enzymatically active when 
expressed in E. coli. 

MERS-CoV 3CL?’’ Hydrolyzes a Fluorescent Peptide Sub- 
strate with Lower Efficiency than Other 3CL? Enzymes—A 
FRET-based peptide substrate was used to measure the enzy- 
matic activity of MERS-CoV 3CLP"° as a function of substrate 
concentration over a substrate concentration range from 0 to 
2.0 uo (Fig. 1A). We observed that MERS-CoV 3CLP*° cannot 
be saturated by the substrate over this concentration range, 
which is typical for other coronavirus 3CLP*° enzymes because 
the K,,, values for peptide substrates approach 1 mm (51-54). 
Therefore, the slope of the kinetic response of MERS-CoV 
3CL?”° to increasing substrate concentration was determined 
to derive an apparent (k.,,/K,,,) value, which is a measure of 


cat 


JOURNAL OF BIOLOGICAL CHEMISTRY 19407 


SLOT ‘OT IsNSNy UO YSingsyig Jo AjIsIOATUA ye /SIO‘Og! MMM//:dyYy Wo popeopumog 


Ligand-induced Dimerization Regulates MERS-CoV 3CL?"° 


A. 
a8 -® MERS-CoV 


-& SARS-CoV 
-@ HKU5-CoV 
—~- HKU4-CoV 


0.08 


0.06 


0.04 


Rate (uM.min‘') 


0.02 
0.00 


0.0 0.25 0.5 1.0 1.5 2.0 
[3CL°°] pM 


0.05 
0.04 
0.03 


0.02 


Rate (uM.min‘') 


0.01 


0.00 
0.00 0.05 010 0.15 0.20 0.25 
[3CL°°] uM 


FIGURE 2. Dependence of the enzymatic activity of MERS-CoV, HKU4-CoV, 
HKU5-CoV, and SARS-CoV 3CL""° on the total enzyme concentration. A, 
kinetic response of each CoV 3CL"° to increasing enzyme concentration is plot- 
ted along with the resulting fit of the data to Equation 2. Resulting values for the 
apparent turnover number, k.,,, and the monomer-dimer equilibrium constant, 
K,, are shown in Table 2. Final enzyme concentrations varied over the concentra- 
tion ranges of 2 4m to 100 nm for MERS-CoV 3CL"°, 500 to 10 nm for SARS-CoV 
3CLP"°, 250 to 0.6 nm for HKU5-CoV 3CLP"?, and 200 to 10 nm for HKU4-CoV 3CL?'°. 
Final substrate concentration was fixed at 2 zm. Experiments were done in tripli- 
cate. Error bars represent the standard deviation for triplicate data. Shaded box 
represents the data that are plotted in B. B, enlarged view of the fitted data at low 
total enzyme concentrations, marked in shaded box in, illustrating the nonlinear 
dependence of enzymatic activity on the total concentrations of 3CL°"° from 
SARS-CoV, HKU5-CoV, and HKU4-CoV. 


enzymatic efficiency. We also determined and compared the 
apparent (k.,,/K,,,) values for 3CL?”° enzymes from SARS-CoV, 
HKU5-CoV, and HKU4-CoV under similar experimental con- 
ditions (Fig. 1B). MERS-CoV 3CLP"® is able to hydrolyze the 
peptide substrate; however, the enzymatic efficiency of MERS- 
CoV 3CLP*° (k..4/K,, = 3.1 + 0.03 X 10°? um! min~*) is 
noticeably lower than other 3CLP"° enzymes tested. Specifi- 
cally, MERS-CoV 3CLP"° was 5-fold less efficient at processing 
the peptide substrate when compared with SARS-CoV 3CLP"®. 
Even among the B-CoVs from the same 2c genogroup (MERS, 
HKU5, and HKU4), MERS-CoV 3CLP*° was the least efficient 
enzyme. 

MERS-CoV 3CL?”? Is a Weakly Associated Dimer—Because a 
dimer has consistently been shown to be the catalytically active 
form of all 3CLP*° enzymes studied to date, we tested the 
hypothesis that the lower enzymatic efficiency of MERS-CoV 
3CLP*° is a result of the reduction in its ability to dimerize. 
Therefore, we determined the dependence of the enzymatic 
activity of MERS-CoV 3CL?*° on the total enzyme concentra- 
tion and compared it with other 3CLP*° enzymes from HKU4, 
HKU5, and SARS coronaviruses (Fig. 2). 
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TABLE 2 


Comparison of the apparent turnover number, k,,,, and the monomer- 


dimer dissociation constant, K,, for 3CLP"° from different CoVs 
Nonlinear fitting of kinetic data* 


3CLPre Kear? Ky 
min! LM 
MERS-CoV 0.2 = 0.02 78+ 13 
SARS-CoV 0.47 + 0.03 0.06 + 0.01 
HKU5-CoV 0.53 + 0.02 0.06 + 0.01 
HKU4-CoV 0.84 + 0.07 0.1 + 0.03 


@ Values were determined through nonlinear fitting of the kinetic data to Equation 
2. 
> ka represents the apparent turnover number. 


It is immediately apparent from the data plotted in Fig. 2 that 
the response of MERS-CoV 3CLP’® enzymatic activity to an 
increasing enzyme concentration is nonlinear. The strong cur- 
vature suggests that a dimer is either the most active form or the 
only active form of MERS-CoV 3CL?"°. To determine the 
mechanism of dimerization, the data in Fig. 2 were first fit to 
Equation 1 (see “Experimental Procedures”), which describes a 
model where both the monomer and the dimer are active. A fit 
of the data to Equation 1 yielded a negative turnover value for 
the monomer (K,.,, ag), Suggesting the monomer is inactive and 
that the dimer is the only active form of the enzyme. Therefore, 
the data were fit to Equation 2 (see “Experimental Procedures”), 
which considers only the dimer as the active form of the 
enzyme. The kinetic data for all four 3CLP*° enzymes, MERS- 
CoV, HKU4-CoV, HKU5-CoV, and SARS-CoV, fit well to this 
model, and the resulting values for the monomer-dimer equi- 
librium dissociation constant, K, and apparent turnover num- 
ber, k.,,, for each enzyme are provided in Table 2. 

The lower k,, value for MERS-CoV 3CL?*°, when compared 
with other coronavirus 3CLP*° enzymes, indicates a moderate 
reduction (2—4-fold) in its ability to turn over the substrate, 
which is consistent with the observed lower apparent (k,,,,/K,,,) 
value. In contrast, there is a substantial reduction in the ability 
of MERS-CoV 3CLP"® to dimerize compared with the other 
3CLP*° enzymes. Based on the K,, values, the capacity of MERS- 
CoV 3CLP"® to dimerize is ~78 —130-fold weaker than the other 
enzymes (Table 2). These results indicate that the MERS-CoV 
3CLP"° dimer is much more weakly associated than the other 
coronavirus 3CLP’° enzymes studied, and these results raise 
questions as to the structural and mechanistic differences 
among the 3CLP"° enzymes that ultimately regulate protease 
activity during coronavirus replication. 

MERS-CoV 3CL””? Inhibition by Designed Peptidomimetic 
Compounds—In an effort to develop potent inhibitors of 
MERS-CoV 3CL?”®, we designed and synthesized nine peptido- 
mimetic compounds containing a Michael acceptor group, ie. 
an a,B-unsaturated carbonyl, capable of irreversibly reacting 
with the active site cysteine of MERS-CoV 3CLP*° (Table 3). 
These compounds were designed and synthesized based on our 
understanding and knowledge of the interactions of similar 
inhibitor molecules with SARS-CoV 3CL?*? (30, 31). At a con- 
centration of 50 um, compounds 6-9 displayed more than 50% 
inhibition of MERS-CoV 3CLP*° and were further evaluated 
for their ability to inactivate the enzyme in a time- and 
concentration-dependent manner (Fig. 3). Data from the 
kinetic progress curve for compound 6 (Fig. 3), as well as for 
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TABLE 3 
Chemical structures and inhibitory activity of compounds 1 to 11 against MERS-CoV 3CLP’° 


The Michael acceptor group for compound 1 is shaded to highlight this group for all the compounds. The stereochemistry at the benzyl stereocenter of compound 5 isa 1:1 
mixture of enantiomers (racemic); therefore, the compound was tested as a mixture of diastereomers. 


Peptidomimetic compounds with Michael-acceptor groups Non-covalent 
peptidomimetics 


Cmpd % Inhi © he ie K;‘ Cmpd ice” 
in 46 nd nd nd 10 >100 
2° 11 nd nd nd 11 >100 
a" 21 nd nd nd 
4° 0 nd nd nd 
5 0 nd nd nd 

6 99 0.81+0.08 0.86+ 0.08 3.6+0.8 
7 100 0.84+0.05 0.83+0.05 4.7+0.6 
8 100 L122020 06220.1] 90+2.3 
9 100 1.13+40.20 0.61+0.11 99+26 


*% inhibition was measured as the % loss in enzymatic activity after 20 min of incubation of 500 nm MERS-CoV 3CLP"® with 50 jum of the compound. 

“ As compounds 1—5 showed <50% inhibition of MERS-CoV 3CLP”°, values of Kinacts 1/2” and K; were not determined (nd) for these compounds. 

® Kinact iS X10~3 s~}, 

“ty” is X 10° s. 

@ Kis in uM. 

© ICs values for compounds 10 and 11 were calculated from a dose- response curve determined after 10 min of incubation of 1 4m MERS-CoV 3CL?° with varying concen- 
trations of compounds. IC;o is in uM. 
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compounds 7-9 (data not shown), were fit to the appropriate 
equations (see under “Experimental Procedures”) to obtain the 
kinetic parameters, k,,,,., ¢%, and K,, and the resulting values are 
provided in Table 3. 

We identified four compounds, 6-9, as micromolar inhibi- 
tors of MERS-CoV 3CL?”° with K; values less than 10 um (Table 
3). Analysis of structure-activity relationships of these com- 
pounds suggests that the S, subsite pocket of MERS-CoV 
3CLP*° is small and can only accommodate a smaller P,-isobu- 
tyl substituent (compounds 6-9) but not bigger substituents 
such as P,-benzyl or P,-isobutylenyl (compounds 1-5). It was 
also observed that replacing the P,-ethoxy (compound 6) with 
P,-isopropoxy (compounds 7 and 8) had no effect on the inhib- 
itory activity of the compounds. Finally, these compounds pro- 
vide an excellent chemical scaffold to study the molecular 
details of interactions of substrate-like compounds with the 
enzyme and to develop more potent inhibitors of MERS-CoV 
3CL?”° for therapeutic intervention. 

To evaluate broad spectrum specificity of these compounds, 
we also calculated % inhibition of SARS-CoV 3CL?"°, HKU5- 
CoV 3CLP®, and HKU4-CoV 3CL?”® after 20 min of incubation 
in the presence of 50 uM compounds 6-9. Except for com- 
pound 9, which inhibited SARS-CoV 3CL?*° by 76%, we 
observed 100% inhibition of all other enzymes in the presence 
of compounds 6-9. Furthermore, we performed progress curve 
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0.45 Compound 6 °%S-nx 


— 50 
0.40 H ? 4 4 saat 
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“ Background 
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FIGURE 3. Progress curves for the MERS-CoV 3CL°"°-catalyzed reaction in 
the presence of compound 6. Time-dependent hydrolysis of 1 wm substrate 
catalyzed by 500 nm MERS-CoV 3CL°"° was measured over a time period of 70 
min and at fixed variable concentrations of compound 6 ranging from 0 to 50 
po. Values for the inactivation kinetic parameters k;,.¢¢, ti72, and K, were cal- 
culated by fitting the progress curve data to Equations 4 and 5. Chemical 
structure of compound 6 is shown in the inset. 


analysis of HKU5-CoV 3CLP*° and HKU4-CoV 3CL?"® in the 
presence of varying concentrations of compounds 6-9. The K; 
values of compounds 6-9 for HKU5-CoV 3CL?”® are 0.49 + 
0.16, 0.60 + 0.21, 1.30 + 0.53, and 0.47 + 0.06 jum, respectively. 
The K; values of compounds 6-9 for HKU4-CoV 3CL?"® are 
0.39 + 0.14, 0.50 + 0.17, 0.85 + 0.33, and 0.64 + 0.25 uM, 
respectively. These data suggest that peptidomimetic com- 
pounds 6-9 have the potential to be developed as coronavirus 
3CL?”° inhibitors with broad spectrum specificity. 

Weak Association of the MERS-CoV 3CL””° Dimer Is Sup- 
ported by AUC Studies—To further explore the mechanism of 
MERS-CoV 3CLP*® dimerization, we performed analytical 
ultracentrifugation sedimentation velocity (AUC-SV) studies 
at varying concentrations of MERS-CoV 3CLP° (Fig. 4A). 
Unlike enzyme kinetics, AUC allows determination of the 
monomer-dimer equilibrium constant (K,) in the absence of 
substrate. MERS-CoV 3CLP"® displayed a continuous size dis- 
tribution at different protein concentrations. Two distinct 
peaks corresponding to monomer (2.9 S) and dimer (3.9 S) spe- 
cies are observed, with the dimer peak becoming more pro- 
nounced at higher enzyme concentrations (Fig. 4A). We fit the 
AUC data to a monomer-dimer equilibrium model to deter- 
mine the values for K, and k,, where K, is the equilibrium 
dissociation constant for a monomer from the dimer, and k,¢, is 
the rate constant for dissociation of the monomer from the 
dimer. The resulting best fit value for K, is 52 + 5 wm and that 
for kjpis 10° *s~*. The K,, value of 52 wm for MERS 3CL?"® is 
dramatically different from SARS-CoV 3CLP", which has 
reported K, values ranging from low nanomolar up to 10 um 
depending on the enzyme construct used and the experimental 
conditions and methods utilized to determine the dissociation 
constant (37). The dimer affinity of MERS-CoV 3CL?”® is sub- 
stantially weaker than that for SARS-CoV 3CL?"°, when com- 
paring the same enzyme construct, i.e. the enzyme without any 
N- or C-terminal modifications. The AUC-SV calculated K, 
value for MERS-CoV 3CLP"® is ~ 150,000 times higher than the 
value of 0.35 nm determined for SARS-CoV 3CLP’? (34). 

The AUC results (Fig. 44) show that the monomer peak at 
~2.9§ does not gradually shift peak position toward the dimer 
peak at ~3.9S with increasing concentrations of MERS-CoV 
3CLP"®; rather, the two peaks change in area, which is indicative 
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FIGURE 4. AUC-SV analyses of ligand-induced dimerization of MERS-CoV 3CL?"°. A, sedimentation coefficient distribution for varying concentrations of 
MERS-CoV 3CLP'° (4.1 to 23 um) with sedimentation coefficient values of 2.95 and 3.9S for the monomer and the dimer, respectively. The best fit value for 
AUC-SV-calculated K, is 52 + 5 uo. B, sedimentation coefficient distribution of MERS-CoV 3CLP'° (25 yum) in the presence of different stoichiometric ratios of 
compound 6 (25, 50, and 100 um). C, sedimentation coefficient distribution of MERS-CoV 3CL"’° (25 jm) in the presence of different stoichiometric ratios of 
compound 10 (25, 50, and 100 ym). A significant shift in the 2.95 peak (monomer) to a 4.1S peak (dimer) is detected upon addition of increasing concentrations 
of compounds 6 and 10. 
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of very slow monomer-dimer exchange rate (kj ~10 * s_') 
and the formation of hydrodynamically stable monomer and 
dimer species (55). This k,,- value is 1000 times slower than the 
kee Value (107! s~*) reported for SARS-CoV 3CL?”° indicating 
that the SARS-CoV enzyme hasasignificantly more rapid mono- 
mer-dimer exchange rate (56). These observations support a 
model whereby the MERS-CoV 3CLP*° dimer is weakly associ- 
ated, suggesting the enzyme exists mainly as a monomer in 
solution. 

MERS-CoV 3CL?’° Undergoes Extensive Ligand-induced 
Dimerization—The weak association of MERS-CoV 3CLP*° 
monomers engenders the following questions. “Are higher lev- 
els of expression of 3CLP"° in MERS-CoV-infected cells neces- 
sary to allow formation of active dimer?” “Are other mecha- 
nisms such as substrate- or ligand-induced dimerizations 
involved in activating 3CLP"°?” To explore the latter question of 
ligand-induced dimerization of MERS-CoV 3CLP"°, we per- 
formed AUC experiments in the presence of compound 6, 
which acts as a substrate mimetic and mechanism-based inhib- 
itor, also known as a suicide substrate. Peptidomimetic com- 
pounds such as compound 6, which contains a Michael accep- 
tor group, interact and react with the active site cysteine of 
cysteine proteases to covalently modify them. We utilized com- 
pound 6 to form a covalent MERS-CoV 3CL?”® and inhibitor 6 
complex that is stable over long periods of time, making it ame- 
nable to analysis by AUC-SV experiments. In contrast, incuba- 
tion of a normal peptide substrate with the enzyme would lead 
to immediate hydrolysis of the substrate and dissociation of the 
products from the enzyme, confounding AUC experiments and 
subsequent data analysis. 

MERS-CoV 3CL?*° was incubated with varying concentra- 
tions of compound 6 in stoichiometric ratios of 1:1, 1:2, and 1:4. 
The modified enzyme was then subjected to AUC studies to 
determine the influence of compound 6 on the mono- 
mer-dimer equilibrium (Fig. 4B). A significant shift in the area 
under 2.9S peak (monomer) to 4.1S peak (dimer) is detected 
upon addition of increasing concentrations of compound 6. We 
obtained similar results when AUC studies were performed uti- 
lizing a complex of MERS-CoV 3CL?"® with a noncovalent pep- 
tidomimetic inhibitor (compound 10, Figs. 4C). The transition 
of MERS-CoV 3CLP”° from monomer to dimer in the presence 
of compounds 6 and 10 suggests that the enzyme undergoes 
extensive dimerization upon substrate binding. 

MERS-CoV 3CL?”° Is Activated by Ligand-induced Dim- 
erization—The observed ligand-induced dimerization of 
MERS-CoV 3CLP", as demonstrated through AUC studies, 
prompted us to investigate whether or not the enzymatic activ- 
ity of MERS-CoV 3CL?"® could be increased at low concentra- 
tions of acompound via ligand-induced dimerization. To do so, 
we chose to use a noncovalent peptidomimetic compound 
(compound 10, Fig. 5A) that we previously identified as an 
inhibitor of SARS-CoV 3CL?”°. Because of the time-dependent, 
irreversible nature of the reaction between compound 6 and 
MERS-CoV 3CL?"®, use of compound 6 was not ideal for these 
kinetic studies as it would further complicate kinetic data 
analysis. 

The kinetic response of MERS-CoV 3CLP*° to increasing 
concentrations of compound 10 was first measured at a single 
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enzyme concentration of 1.0 wm (Fig. 5A). Interestingly, an 
increase in the activity of MERS-CoV 3CL?"®, as high as 195%, 
was observed in the presence of low inhibitor concentrations 
(0.1 to 20 um). Inhibition of enzymatic activity was observed 
only at higher inhibitor concentrations (40 pM or greater). 
These results suggest that at low concentrations, compound 10 
binds to amonomer and induces the formation of a dimer. The 
resulting dimer then has one free active site that is capable of 
processing the substrate. At higher concentrations of inhibitor, 
the substrate and inhibitor directly compete for the free active 
site. 

The model of activation and inhibition suggested by the data 
at 1 uM enzyme would predict that at higher enzyme concen- 
trations less activation by a compound would be observed at 
lower inhibitor concentrations, and the inhibition of activity 
would be detected at lower inhibitor concentrations because 
the equilibrium would be pushed toward dimer formation. In 
contrast, lower enzyme concentrations would result in 
higher activation by compounds, and inhibition by the com- 
pound would occur at significantly higher compound con- 
centrations. Therefore, we further measured the activity of 
MERS-CoV 3CL?"® at two additional enzyme concentrations 
(0.5 and 2.0 jum) in the presence of varying concentrations of 
compound 10. Remarkably, we observed that the activation 
effect was most pronounced at the lowest MERS-CoV 3CL?"® 
concentration tested (0.5 wm), and the effect decreased as the 
enzyme concentration was increased (1.0 and 2.0 um) (Fig. 
5A). Moreover, inhibition by compound 10 occurred at 
lower compound concentrations when higher concentra- 
tions of enzyme were used. These observations further sup- 
port a model whereby enzyme activation can occur through 
ligand-induced dimerization. 

The activation and inhibition of MERS-CoV 3CL?*° by com- 
pound 10 can be explained by a simple kinetic model depicted 
in Fig. 5B. The MERS-CoV 3CLP"® monomer exists in equilib- 
rium with the dimer, and their relative concentrations depend 
on the total enzyme concentration. In the absence of substrate 
or compound, the K, value is 52 ym, and the equilibrium is 
represented by the gray spheres (blue box) in Fig. 5B. The mono- 
mer is unable to hydrolyze the substrate and is therefore inac- 
tive. Binding of inhibitor (Fig. 5B, green triangle) to the mono- 
mer results in monomer to dimer switch leading to the 
formation of a dimer that contains inhibitor bound in one of 
the active sites. Once the dimer is formed, the substrate binds in 
the second active site and catalysis takes place. Under high 
inhibitor concentrations, however, the inhibitor molecule 
directly competes with substrate for the free dimer active site, 
and inhibition of the enzymatic activity is observed as a result. 

We would also expect to observe induced dimerization and 
activation in the presence of the substrate. Indeed, the mono- 
mer-dimer kinetic studies performed in Fig. 2 were performed 
at a fixed concentration of substrate at 2 jm. In this experiment, 
the K, value for the MERS-CoV 3CLP’° dimer was determined 
to be 7.8 «xm, which is lower than the K,, value determined in the 
absence of substrate using AUC, thereby supporting substrate- 
induced dimerization. Given the high K,,, value of 3CL?”® for the 
peptide substrate (51-54), even higher substrate concentra- 
tions would be required to observe substrate activation in a plot 


JOURNAL OF BIOLOGICAL CHEMISTRY 19411 


SIOZ ‘OT IsNSny UO YSingsyig Jo AjIsIOATUA ye /SIO‘Og! MMmM//:dyy Wo popeojumog 


Ligand-induced Dimerization Regulates MERS-CoV 3CL?"° 


A 250 


—e 0.5 uM MERS-CoV 3CL"° 
-- 1.0 uM MERS-CoV 3CL?°° 


rm 200 + 2.0 pM MERS-CoV 3CL?° 
= 
2 s 
_ 
© 150 . | 
x fe) Ney 
BE Gogomoonone:) Janonegenon <qREK Goon eeenneee Compound 10 
50 
0 20 40 60 
[Compound 10] pM 
B Absence of Ligan A resence of Ligands 
monomer monomer with 
bound ligand 
Inhibition 
| , A) = Ala 
N ———— 
, _ dimer 


k 


Substrate 


Activation 


FIGURE 5. Activation of MERS-CoV 3CLP"° via ligand-induced dimerization. A, enzymatic activity of 0.5, 1.0, and 2.0 wm MERS-CoV 3CLP"° was measured in 
the absence and presence of varying concentrations of compound 10. Substrate concentration was fixed at 2.0 ym. % activity, normalized to zero inhibitor 
enzymatic activity, was plotted as a function of increasing inhibitor concentrations. Error bars represent the standard deviation for triplicate data. Increase in 
enzymatic activity (highlighted in cyan-shaded box) is observed in the presence of low concentrations of compound 10. Inhibition of enzymatic activity is 
observed at higher inhibitor concentrations (highlighted in yellow-shaded box). B, kinetic model describing the equilibrium between different species of 
MERS-CoV 3CLP"° that are formed in the absence (blue box) and presence (green box) of a ligand is shown. Based on the AUC-calculated K, value of ~ 52 um, 
MERS-CoV 3CLP'° primarily exists as a monomer in solution in the absence of a ligand. Upon ligand binding (inhibitor / in our case) to the monomer, the 
monomer-dimer equilibrium shifts toward dimer formation. Next, under lower inhibitor concentrations (cyan-shaded box), substrate (S) binds in the second 
active site and catalysis takes place. However, under higher inhibitor concentrations (yellow-shaded box), inhibitor directly competes with the substrate for the 


second active site, and inhibition of the enzymatic activity is observed. 


of catalytic activity versus substrate concentration. However, 
we are limited to use our FRET-based substrate only at low 
concentrations due to a significant inner filter effect at higher 
concentrations of substrate. Therefore, a compound that both 
mimics substrate and has higher binding affinity can act as a 
useful surrogate for the substrate, allowing the observation of 
ligand-induced dimerization and activation even at low sub- 
strate concentrations. 

X-ray Structure of MERS-CoV 3CL?”? in Complex with Com- 
pound 6—To gain atomic level detail and molecular insight into 
the mechanism for substrate-induced dimerization of MERS- 
CoV 3CLP*°, we attempted to crystallize and determine the 
x-ray structures of the unliganded MERS-CoV 3CL?*° mono- 
mer and the MERS-CoV 3CL?P*® covalently modified with com- 
pound 6. Unfortunately, we were unable to crystallize the unli- 
ganded MERS-CoV 3CL?*° monomer after multiple attempts, 
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but we were able to crystallize and determine the x-ray struc- 
ture of MERS-CoV 3CL?*° in complex with compound 6 to a 
resolution of 1.6 A. The statistics for x-ray data collection, pro- 
cessing, and refinement are summarized in Table 4. The MERS- 
CoV 3CLP"° and 6 complex crystallized as a biologically rele- 
vant, symmetrical dimer in space group C2 with one monomer 
in the asymmetric unit. Electron density for the entire protein 
was Clearly visible and strong electron density (F, — F, >40) 
was present for compound 6 within the active site (Fig. 6A). 
MERS-CoV 3CL?"? Has a Smaller S, Pocket than SARS-CoV 
3CL?’°—The active site of MERS-CoV 3CL?*° bound with com- 
pound 6 is shown in Fig. 6, A and B. Compound 6 is covalently 
bound to the active site cysteine (Cys-148) via a 1.8 A bond 
between the y-sulfur and the electrophilic B-carbon of the 
Michael acceptor. The P’ ,-ethyl ester carbonyl, which mimics 
the carbonyl of the scissile bond in a substrate, forms a hydro- 
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TABLE 4 
X-ray data collection and refinement statistics 


MERS-CoV 3CL?*°6 


MERS-CoV 3CL?™™11 


Beamline LRL-CAT sector 31 ID-D LS-CAT sector 21 ID-G 
Data collection 
Wavelength (A) 0.9793 0.9786 
Resolution range (A) 19.35-1.62 (1.68—1.62)% 50.00—2.10 (2.14—2.10)* 
Protein monomers in asymmetric unit 1 4 
Space group C2 P2, 
Unit cell dimensions 
a, b,¢ (A) 106.49, 57.31, 48.88 63.44, 114.93, 92.34 
a, B, y (°) 90, 112.78, 90 90, 90.89, 90 
Total no. of reflections 63,855 816,216 
No. of unique reflections 32,851 76,865 
Multiplicity 1.9 (1.9)* 2.2 (2.2)* 
Completeness (%) 95.0 (93.8)* 96.8 (93.8)% 
Mean J/ol 5.2. (1,3)* 11.17 (1.83)* 
Rirerge (%))” 8.3 (67.2)* 8.8 (58.6)* 
Refinement 
Resolution range (A) 19.35-1.62 42.59-2.10 
No. of reflections in working set 30824 76623 
No. of reflections in test set 2026 2019 
Rerorte (%)° 17.8 15.91 
Riveo (%)° 21.7 21.51 
No. of non-hydrogen atoms 
Protein/water 2380/208 9383/995 
r.m.s.d.,” bond lengths (A) 0.007 0.013 
r.m.s.d., bond angles (°) 1.09 1.35 
Ramachandran favored (%) 99 98 
Ramachandran outliers (%) 0 0 
Molprobity clash score 3.3 1.94 
Average B-factor (A?) 20.4 33.1 
Protein 19.8 32.5 
Ligands 16.6 41.1 
Solvent 27.7 37.9 
“Values in parentheses are for highest resolution shell. 
” Rinerge = Lrilli(h) — ((h))\/=,,2 (A), where 1,(h) is the ith measurement and (/(/)) is the weighted mean of all measurements of I(h). 


© Ryork aNd Rees = h(\F(L),| — | F(A),|)/h | F(),| for reflections in the working and test sets, respectively. 
d 


r.m.s.d. is root mean square deviation. 


gen bond with the backbone NH of Gly-146 that forms part of 
the oxyanion hole (Fig. 6B). Within the S, subsite, the P,-lactam 
carbonyl, which is a surrogate for the amide of P,-glutamine of 
substrates, participates in a hydrogen bonding interaction with 
the imidazole ring of His-166, and the P,-lactam NH forms a 
hydrogen bond with the carboxylate oxygen of Glu-169. The 
P,-backbone amide NH forms a hydrogen bond with the side 
chain carbonyl of Gln-192 (Fig. 6B). The P,-leucine side chain 
atoms of the inhibitor make hydrophobic contacts with the side 
chains of Met-168 and Leu-49 that line the S, subsite pocket. 
Moreover, compared with the equivalent residue Thr-25 in 
SARS-CoV 3CLP*°, Met-25 in the S, pocket of MERS-CoV 
3CLP*° is expected to reduce the size of the hydrophobic 
pocket, which is supported by our observed SAR described 
above. 

The smaller size of the S, pocket in MERS-CoV 3CL?”® is also 
consistent with the preference for a smaller leucine residue at 
the P, position of cleavage sites instead of a bulkier phenylala- 
nine or methionine residue. Indeed, analysis of the preference 
for leucine or phenylalanine at the P, position for the 11 3CL?”° 
cleavage sites within the polyprotein of MERS-CoV shows that 
none of the 11 cleavage sites contain a phenylalanine residue at 
this position (Fig. 6C). Leucine is the predominantly favored 
residue at this position followed by methionine. Analysis of the 
cleavage sites from SARS-CoV, HKU4-CoV, and HKU5-CoV 
shows that none of the 11 cleavage sites from group 2c mem- 
bers (MERS-CoV, HKU4-CoV, and HKU5-CoV) contain a phe- 
nylalanine residue at the P. position; however, the SARS-CoV 
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nsp5 | nsp6 cleavage site contains a phenylalanine residue at 
this position. 

Other interactions are also observed to play a significant role 
in stabilizing the MERS-CoV 3CLP"°-compound 6 complex. 
The P,-carbonyl and P-NH participate in hydrogen bonding 
interactions with the backbone NH and carbonyl of Glu-169. 
The P,-serine side chain is within hydrogen bonding distance of 
the side chain carboxamide of Gln-195 and the backbone car- 
bony of Lys-191. 

X-ray Structure of MERS-CoV 3CL”” in Complex with a Non- 
covalent Inhibitor—We were also able to obtain diffraction 
quality crystals of MERS-CoV 3CL?"® in complex with com- 
pound 11, which has an almost identical chemical structure as 
that of compound 10 (Fig. 6D). We previously showed that 
compounds similar to 10 and 11 act as potent noncovalent 
inhibitors of 3CL?*° from SARS-CoV (33). The x-ray structure 
of compound 11 bound to MERS-CoV 3CL?"® was determined 
to a resolution of 2.1 A and the x-ray data collection, processing, 
and refinement statistics are summarized in Table 4. The 
MERS-CoV 3CL?"® and 11 complex crystallized in space group 
P2, with two biologically relevant dimers in the asymmetric 
unit. The overall root mean square deviation between the C-a 
atoms of the four chains was less than 1 A, with the highest C-a 
root mean square deviation of 0.719 A between chains C and D. 
Strong electron density (F, — F, >40) was present for com- 
pound 11 within all the four active sites of the two dimers (Fig. 
6D). 
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FIGURE 6. X-ray crystal structure of MERS-CoV 3CL?"° in complex with inhibitors. A, solvent-accessible surface (gray-shaded surface) of MERS-CoV 3CL°"° 
and compound 6 complex. Compound 6 is displayed in ball and stick model with atoms colored as follows: carbons (orange), nitrogens (blue), and oxygens (red). 
Electron density associated with compound 6 is shown as an F,, — F_ electron density difference map contoured to 3a (green mesh). Substrate binding pockets 
S,-S', are labeled, where asterisk indicates the electrophilic carbon of compound 6 that forms a C-S covalent bond with the active site cysteine Cys-148. B, 
MERS-CoV 3CLP° and compound 6 complex with the MERS-CoV 3CLP"° backbone represented as a ribbon model and relevant amino acids that interact with 
compound 6 represented as ball and sticks. MERS-CoV 3CLP'° carbon atoms are colored blue, and compound 6 carbon atoms are colored orange. Nitrogen 
atoms are colored blue, and oxygen atoms are colored red. Catalytic residues Cys-148 and His-41 are also shown. Hydrogen bonds are depicted as red dashed 
lines. C, sequence logos showing amino acid conservation for the 11 polyprotein cleavage sites of different 3CL°’° enzymes (MERS-CoV, HKU5-CoV, HKU4-CoV, 
and SARS-CoV), generated using the WebLogo server (63). Residues P,-P’, are shown. Height of each letter corresponds to the amino acid conservation at that 
position. D, solvent-accessible surface (gray-shaded surface) of MERS-CoV 3CL°"° and compound 11 complex. Compound 11 is displayed in ball and stick model. 
Electron density associated with compound 11 is shown as a 2F,, — F, electron density difference map contoured to 1.50 (green mesh). Functional groups of 
compound 11 with their corresponding binding pockets are highlighted in yellow, green, and blue ellipses. Chemical structure of compound 11 is shown in the 
inset. E, interactions between MERS-CoV 3CL°"° and compound 11 are illustrated. Catalytic residues Cys-148 and His-41 are also shown. Hydrogen bonds are 
depicted as red dashed lines. 


The binding orientation for compound 11 in the active site of 
MERS-CoV 3CLP"® is similar to the binding orientation of 


NH of conserved Glu-169. The NH of the phenyl! propionami- 
dyl group interacts with backbone carbonyl oxygen of the cat- 


related compounds in the active site of SARS-CoV 3CL?*° (PDB 
code 4MDS). The benzotriazole group binds in the S, subsite; 
phenyl propionamidyl occupies the S’ ,-S, subsite, and the thio- 
phene group binds in the S, subsite. Compound 11 also forms 
two direct and one water-mediated hydrogen bond interactions 
with amino acids in the MERS-CoV 3CL?”® active site (Fig. 6£). 
The N3 of the benzotriazole ring forms a hydrogen bond with 
the side chain e-nitrogen of conserved His-166, and the central 
acetamide oxygen forms a hydrogen bond with the backbone 
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alytic His-41 residue through a water-mediated hydrogen 
bond, and the imidazole ring of His-41 engages with the phenyl 
ring of phenyl propionamidyl group through T-shaped 7 stack- 
ing. The phenyl ring also form hydrophobic contacts with 
Leu-49. 

Interactions at the 3CL”?’? Dimer Interface—Analysis of the 
MERS-CoV 3CLP" and 6 and MERS-CoV 3CL?*® and 11 crys- 
tal structures reveals key differences between the dimer inter- 
face of MERS-CoV and SARS-CoV 3CLP’° (PDB code 2ALV) 
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FIGURE 7. Comparison of x-ray crystal structures of 3CL?"° dimers from MERS-CoV, HKU4-CoV, and SARS-CoV. A, superposition of dimers of MERS-CoV 
3CL"° (pink color), HKU4-CoV 3CLP"° (yellow color, PDB code 2YNB), and SARS-CoV 3CL?"° (blue color, PDB code 2ALV). For SARS-CoV 3CL°"°, residues Arg-4 and 
Ser-123 from monomer A, and residues Gln-127, Lys-137, Glu-290, and Met-298 from monomer B are represented as spheres. B, for SARS-CoV 3CLP"°, interac- 
tions between the side chain of Arg-4 from monomer A and Gln-127, Glu-290, and Lys-137 residues from monomer B are shown. The corresponding residues 
in MERS-CoV 3CL?"° and HKU4-CoV 3CL”” are Val-4 in monomer A and Glu-290 in monomer B, which do not interact at the dimer interface. C, for SARS-CoV 
3CLP"°, Ser-123 from monomer A engages in hydrogen bonding with Arg-298 from monomer B across the dimer interface. The corresponding residue in 
monomer B of MERS-CoV 3CL°"° and HKU4-CoV 3CLP"? is Met-298, which does not participate in any interaction with Thr-126 from monomer A across the dimer 


interface. 


(Fig. 7) (30). Two arginine residues, Arg-4 and Arg-298 (Fig. 7, 
A-C), form some of the key interactions at the dimer interface 
of SARS-CoV 3CL?"°, and mutation of either of these amino 
acids results in a drastic loss of dimerization in SARS-CoV 
3CLP*° (36, 38). Interestingly, these two arginine residues 
(Arg-4 and Arg-298) are substituted in MERS-CoV 3CLP"° by 
two hydrophobic residues (Val-4 and Met-298) that are unable 
to participate in the formation of hydrogen bonds or salt 
bridges. Therefore, we initially thought that the loss of these key 
interactions might simply explain the >100,000-fold weaker 
dimerization observed for MERS-CoV 3CL?"° compared with 
SARS-CoV 3CL?”®. Surprisingly, however, structural analysis of 
the dimer interface from the available x-ray structure of HKU4- 
CoV 3CLP*° (PDB code 2YNB; Fig. 7, B and C), and primary 
sequence alignment of 3CLP’® from MERS-CoV, HKU5-CoV, 
HKU4-CoV and SARS-CoV (Fig. 8) revealed that Val-4 and 
Met-298 are conserved between all the B-CoV 2c members 
studied here. Substantial differences between the ability of 
MERS-CoV 3CLP° and HKU4/HKU5-CoV 3CL?° to 
dimerize, despite their high sequence identity, led us to the 
hypothesis that nonconserved residues between MERS-CoV 
and other B-CoV 2c members that are remote from the dimer 
interface may play a significant role in dimer formation. 
Analysis of Nonconserved Residues of MERS-CoV 3CL?’"°— 
Analysis of our current crystal structures does not reveal a 
clear mechanism for the monomer to dimer switch of MERS- 
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CoV 3CLP*° upon ligand binding. Therefore, we attempted 
to identify the nonconserved residues in MERS-CoV 3CLP*° 
that might affect enzymatic activity due to their proximity to 
key residues involved in substrate binding and/or dimer 
formation. 

Based ona sequence alignment, MERS-CoV 3CL?”° contains 
~24 nonconserved amino acids (pink arrows in Fig. 8). Upon 
analyzing the position of these amino acids in the crystal struc- 
ture, we observed that a remarkable number of these amino 
acids are present in the loop regions. Fig. 9A illustrates the 
nonconserved residues present in the loop regions as gray 
(monomer A) and pink (monomer B) spheres. Interestingly, we 
also observed that there are hot spots in the protein structure 
where most of these amino acids are clustered. These hot spots 
include the N-terminal region, the active site region, the inter- 
domain loop (loop between the catalytic fold and domain III), 
and the domain III. In MERS-CoV 3CLP*°, nonconserved 
amino acid His-8, which forms van der Waals contacts with 
Lys-155 of the same monomer and Thr-128 of the other mono- 
mer, is present at the end of the N-terminal finger (Fig. 9, Band 
C), whereas amino acids Asp-12 and Ala-15 are part of the 
N-terminal helix (Fig. 9B). Additionally, amino acids Thr-128, 
Lys-155, and Ser-158 are present within 6 A of the N-terminal 
region (Fig. 9B). Substitution to these amino acids in MERS- 
CoV 3CLP'° might have changed the protein dynamics in a way 
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FIGURE 8. Sequence alignment of 3CLP'° enzymes from MERS-CoV, HKU5-CoV, HKU4-CoV, and SARS-CoV. Programs MultAlin (64) and ESPript (65) were 
used for the sequence alignment and visualization. Secondary structural elements of MERS-CoV 3CLP'° are represented as spirals for a-helix, arrows for 
B-strands, 7 for 3,, helix, and T for B-turns. Residues Val-4 and Met-298 in MERS-CoV, HKU5-CoV, HKU4-CoV 3CLP", and Arg-4 and Arg-298 in SARS-CoV are 
shown ina green box; catalytic residues His-41 and Cys-148 are highlighted in a purple box. The nonconserved residues of MERS-CoV 3CL?"° are marked with pink 
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arrows. % identity with MERS-CoV 3CL?"° is shown. 
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FIGURE 9. Analysis of the nonconserved amino acids of MERS-CoV 3CLP"°. A, representation of MERS-CoV 3CL°"° dimer with monomers A and B colored in 
orange and yellow, respectively. Nonconserved residues that are present in the loop regions are shown as spheres in gray and pink for monomers A and B, 
respectively. Other nonconserved residues are represented as spheres with the corresponding chain color. Domains I-Ill and the inter-domain loop are labeled. 
Catalytic residues His-41 and Cys-148 are shown as green spheres. Inhibitor molecule is shown in both active sites in blue sticks. B-G, residues of monomer B are 
shown (yellow and pink), unless otherwise labeled. B, clustering of some of the nonconserved amino acids, His-8, Asp-12, Ala-15, Thr-128, Lys-155, and Ser-158, 
near the N-terminal region is shown. N-terminal helices for both monomers are labeled. C, His-8 from the N-terminal region forms van der Waals contacts with 
Lys-155 of the same monomer and Thr-128 of the other monomer in the dimer. D, nonconserved residue Met-61 forms hydrophobic contacts with the Met-43 
residue, which is in close proximity to the catalytic residue His-41.£, loop containing the nonconserved residue Ala-171 forms the S, pocket along with residues 
His-166 and His-175. F, Val-132 forms hydrophobic contacts with a residue within the same domain (Ala-114), as well as Glu-290 from domain Ill. G, noncon- 
served residue Tyr-137 makes hydrophobic contacts with Tyr-185; Tyr-185 along with two other nonconserved residues Thr-183 and Met-189 are present on 


the inter-domain loop. 


that only ligand binding populates the monomer conformation, 
which is more amenable to dimer formation. 

We also observe that some of the nonconserved residues in 
MERS-CoV 3CLP" are located in proximity to the substrate- 
binding site and might contribute toward ligand-induced 
dynamic changes favorable for dimer formation. For example, 
nonconserved amino acid Met-61 forms hydrophobic interac- 
tions with Met-43, which in turn is in close proximity to the 
catalytic residue His-41 (Fig. 9D). Residue Ala-171 is present on 
a loop, and this loop, along with conserved residues His-166 
and His-175, forms the S, subsite for binding the P, amino acid 
of the substrate (Fig. 9£). In addition to its influence on sub- 
strate binding, Ala-171 may also contribute toward dimer for- 
mation upon substrate binding due to its close proximity with 
Glu-169. This glutamate residue in SARS-CoV 3CLP*® (Glu- 
166) has been established as a key residue linking the substrate- 
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binding site to the dimer interface (56). Val-132 forms hydro- 
phobic interaction with other nonconserved residue Ala-114 
within domain II (Fig. 9F). Additionally, Val-132 is present 
within van der Waals contact distance of Glul-290 from extra- 
helical domain III (Fig. 9F). It is noteworthy that Glu-290 forms 
a salt bridge with Arg-4 across the dimer interface in SARS- 
CoV 3CL?"°. However, this interaction is not formed in MERS- 
CoV 3CLP'° due to the substitution of Arg-4 with Val-4. Tyr- 
137 forms hydrophobic contacts with the conserved residue 
Tyr-185 (Fig. 9G). 

Besides amino acid Val-132 that connects domains II and III, 
residue Tyr-185, along with two other nonconserved residues, 
Thr-183 and Met-189, is present on the inter-domain loop that 
connects the catalytic fold (domains I and II) with the extra-helical 
domain III (Fig. 9G). Flexibility within these residues might affect 
the orientation of domain III required for dimer formation. 
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FIGURE 10. Proposed model for polyprotein processing in MERS-CoV regulated by ligand-induced dimerization of MERS-CoV 3CLP"°. MERS-CoV 3CL""° 
domains | and II are together represented as the rectangular box, and domain Ill is represented as a cylinder. The N and C termini are labeled, and the yellow 
cylinder labeled S represents a ligand that can be a peptide inhibitor, peptide substrate, or 3CLP'° cleavage sites in the polyprotein. Various steps required for 
the auto-release of 3CL”’° from the polyprotein and subsequent processing of the polyprotein cleavage sites are described in the text. Suggested by our AUC 
and kinetic studies, the shaded region (steps 5 and 6) highlights the additional steps MERS-CoV 3CLP"° would undertake during polyprotein processing and have 


been described in the kinetic model depicted in Fig. 5B. 


Discussion 


Model for Regulation of the Enzymatic Activity of MERS-CoV 
3CL?”° during Polyprotein Processing—Enzymatic activity of 
coronavirus 3CLP’® is required for the processing of viral poly- 
proteins at 11 distinct cleavage sites, allowing the release of 
nonstructural proteins that subsequently form a replication 
complex for virus genome replication. Because of its indispens- 
able role in the virus life cycle, regulation of the enzymatic activ- 
ity of 3CLP’® is instrumental for efficient replication of corona- 
viruses. Based on our experimental results, we propose a model 
to explain the mechanism for regulating the enzymatic activity 
of MERS-CoV 3CLP"® in the context of polyprotein processing 
during virus infection (Fig. 10). 

A number of in vitro studies performed on SARS-CoV 
3CLP"° have established the mechanism for 3CLP*° auto-release 
from the polyprotein (34, 39, 40). Based upon these studies and 
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our data on MERS-CoV 3CL?"°, we propose the polyprotein 
processing model in Fig. 10. The steps proposed for auto-re- 
lease of MERS-CoV 3CLP"® from the polyprotein (steps 1-4, 
Fig. 10) have been adapted from Chen et al. (39), where it is 
suggested that the N-terminal auto-processing does not require 
the formation of a mature 3CL?’° dimer for SARS-CoV. Based 
on the differences between the properties of SARS-CoV 3CLP"® 
and MERS-CoV 3CL?”®, as highlighted in our studies, we added 
two additional steps (steps 5 and 6, Fig. 10) that MERS-CoV 
3CLP*° may need to utilize for efficient polyprotein processing. 
In Fig. 10, step 1, two immature MERS-CoV 3CL?"° monomers 
in the polyprotein approach each other and form an immature 
dimer via interactions between domain III, which allows each of 
the monomers to insert their N termini into the active site of the 
other monomer. In step 2, the N termini are cleaved, and the 
dimer with uncleaved C termini adopts a conformation similar 
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to the mature dimer. Our observation of auto-cleavage of the 
N-terminal His, tag from MERS-CoV 3CL?*° during expres- 
sion in bacterial cells supports steps J and 2, where formation of 
an immature dimer capable of auto-processing the N terminus 
occurs. In step 3, two dimers with uncleaved C termini 
approach each other, followed by insertion of the C terminus 
from one dimer into one of the active sites of the other dimer. In 
step 4, the C termini are cleaved and mature dimer is released 
from the polyprotein. 

For SARS-CoV, the 3CLP"® dimer formed in step 4 continues 
to process cleavage sites in the polyprotein, effectively skipping 
steps 5 and 6 (red arrow in Fig. 10) because the dimer is tightly 
associated. However, the high K, value of MERS-CoV 3CLP’° 
dimer suggests that the active and mature dimer may dissociate 
into inactive, mature monomers in the absence of any ligand 
(step 5). In order for polyprotein processing to proceed, another 
step (step 6) must occur. In step 6, a substrate S, e.g. one of the 11 
polyprotein cleavage sites, would induce dimer formation and 
hence activate catalysis and cleavage at the substrate recogni- 
tion sites. Our AUC results and the kinetic activation studies 
performed in the absence and presence of inhibitors support 
steps 5 and 6 where the inactive but mature monomers require 
binding ofa ligand to undergo ligand-induced dimerization and 
formation of an active, mature dimer that can then process the 
polyprotein cleavage sites. 

Nonconserved Amino Acids of MERS-CoV 3CL?”? May Regu- 
late the Dimer Formation—Long range interactions have been 
reported to modulate dimerization and activity of 3CLP*° 
enzymes. Barrila et al. (57) demonstrated that mutation of a 
conserved amino acid Ser-147, which is distant from the dimer 
interface, results in a total loss of dimerization and enzymatic 
activity of SARS-CoV 3CLP"°. Although Ser-147 does not form 
direct interactions at the dimer interface, disruption of the 
dimer upon mutation stems from the fact that Ser-147 makes 
several interactions with other residues involved in forming a 
hydrogen bonding network within SARS-CoV 3CLP"™. Site-di- 
rected mutagenesis studies on domain III of SARS-CoV 3CL?”®, 
where N214A and $284A/T285A/1286A mutants were charac- 
terized, revealed that despite being present on an entirely dif- 
ferent domain, these residues affect catalysis through a network 
of residues undergoing correlated motions across the entire 
protease (58, 59). Utilizing 3CL?*° temperature-sensitive 
mutants of MHV, Stobart et al. (60) have also demonstrated 
that second-site mutation physically distant from the tempera- 
ture-sensitive mutation suppresses the temperature-sensitive 
phenotype through long range interactions, thereby regulating 
3CLP° enzymatic activity during polyprotein processing and 
virus replication. 

Our studies also suggest that long range interactions among 
the nonconserved residues can significantly alter the properties 
of MERS-CoV 3CL?"®. A detailed analysis of nonconserved res- 
idues of MERS-CoV 3CLP*° among B-CoV 2c members identi- 
fied hot spots, including the N-terminal finger and helix, the 
active site region, the inter-domain loop, and the domain III, 
where these residues are clustered. Several studies done on 
SARS-CoV 3CLP*° have demonstrated that amino acids from 
the N-terminal finger, the N-terminal helix, and domain III 
significantly contribute toward dimer formation. 
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In addition to the direct interactions at the dimer interface, 
correct orientation between the catalytic fold and domain III is 
also crucial for dimer formation. Wu et al. (61) showed that the 
mostdramaticdifferencebetweenthecrystalstructuresofmono- 
mer and the ligand-bound dimer of the R298A mutant of SARS- 
CoV 3CL?"° was a 33° rotation of domain III (38). This rotation 
results ina steric clash between domain III from two monomers 
and would essentially block dimer formation. However, upon 
addition ofa ligand, domain III of the R298A mutant adopts the 
correct orientation and results in the formation of a dimer 
structure. Similar to the SARS-CoV 3CLP*° R298A mutant, 
ligand binding into the active site of the MERS-CoV 3CLP"° 
monomer possibly stabilizes the inter-domain loop conforma- 
tion that maintains domain III in the correct orientation for 
dimer formation. Most of the nonconserved residues within 
domain III are present on the surface and also are distant from 
the dimer interface. These residues may be involved in provid- 
ing the flexibility required for conformational changes during 
the monomer to dimer switch. 

We have identified several amino acids in MERS-CoV 3CLP”° 
that may contribute to the dimer formation upon ligand bind- 
ing. However, single amino acid mutagenesis alone is unlikely 
to reveal significant differences in the dimerization properties. 
As demonstrated by Myers et al. (62) for ornithine decarboxy- 
lase, the response of single amino acid to ligand binding may be 
limited to only local conformational changes and may not have 
significant contribution toward dimer stability. However, local 
conformational changes in a network of residues may propa- 
gate larger effects that stabilize dimer formation upon ligand 
binding. Analysis of the nonconserved residues of MERS-CoV 
3CLP*° discussed here sets forth a framework to perform sys- 
tematic single or multiple mutagenesis studies to gain insights 
into the mechanism for ligand-induced dimerization of the 
enzyme. 

Development of 3CL? Inhibitors with Broad Spectrum 
Specificity—Insights into the mechanistic and structural simi- 
larities as well as differences between 3CL?"® enzymes from 
different coronavirus subgroups are instrumental for the devel- 
opment of 3CL?”° inhibitors with broad spectrum specificity. 
To evaluate the broad spectrum specificity of our peptidomi- 
metic compounds, we determined their inhibitory activity 
against 3CLP’° from MERS-CoV, SARS-CoV, HKU5-CoV, and 
HKU4-CoV. Our inhibitory data and K; values clearly show that 
compounds 6-9 inhibit all the 3CLP’° enzymes tested here. 
The x-ray structure of MERS-CoV 3CLP° in complex with 
compound 6 revealed that out of eight direct hydrogen bonds 
formed between compound 6 and MERS-CoV 3CL?"®, four of 
these hydrogen bonds involve interactions with conserved 
structural elements of the peptide backbone of the enzyme. 
Furthermore, the amino acids that form hydrogen bonds with 
compound 6 through side chain interactions are conserved in 
all the coronavirus 3CL?*° enzymes evaluated here, as well as 
3CLP*° enzymes from other B-coronaviruses like MHV, OC43, 
and HKU1. These results suggest that canonical structural fea- 
tures exist among the 3CL?*° enzymes that can be exploited for 
structure-based design of broad spectrum inhibitors. 

For the noncovalent inhibitor compound 11, the x-ray struc- 
ture reveals two direct hydrogen bonding interactions between 
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the compound and MERS-CoV 3CL?"®. One of the hydrogen 
bonds forms with the side chain e-nitrogen of conserved His- 
166, and the second involves the backbone NH of conserved 
Glu-169. We speculate these interactions remain conserved in 
other 3CLP’° enzymes as well, because His-166 and Glu-169 
amino acids are conserved in all 3CLP*° enzymes. In fact, the 
crystal structure of SARS-CoV 3CL?*° in complex with an 
inhibitor similar to compound 11 (PDB code 4MDS) reveals 
that the interactions of the inhibitor with the amino acids His- 
166 and Glu-169 are conserved. 

The identification of 3CL?*°-inhibitor interactions utilizing 
conserved elements of the protein structure, including the pep- 
tide backbone and conserved side chains of active site residues, 
suggests that the development of broad-spectrum inhibitors of 
coronavirus 3CL?”° is feasible. 

Our studies here demonstrate the unique properties of 
MERS-CoV 3CLP"® among B-CoV 2c members, evident from 
the requirement for a ligand to induce dimerization. Although 
the peptidomimetic compounds containing a Michael acceptor 
group (for example, compounds 6-9) induce dimer formation 
of MERS-CoV 3CL?"®, the irreversible nature of their reaction 
with the active site cysteine ensures complete inhibition of the 
enzyme at stoichiometric ratios in a time-dependent manner. 
On the contrary, noncovalent peptidomimetic compounds (for 
example, compounds 10 and 11) inhibit the enzymatic activity 
of MERS-CoV 3CL?”® only at high compound concentrations. 
Based on these observations, compounds that irreversibly mod- 
ify the 3CLP"® active site may serve as better candidates for the 
development of inhibitors for MERS-CoV 3CL?"®. 

Potential Complexity in the Development of MERS-CoV 
3CL?"? Inhibitors as Antiviral Agents—Induced dimerization of 
MERS-CoV 3CLP", as seen in the presence of peptidomimetic 
inhibitors, has significant implications in the development of 
antiviral agents targeting MERS-CoV 3CL?"®. As a consequence 
of enzyme activation, the development of an effective antiviral 
agent may necessitate the development of a compound that can 
inhibit the MERS-CoV 3CL?"° monomer and stabilize it with- 
out inducing dimerization and/or inhibit the active sites of the 
dimer at low doses, ensuring inactivation of both the active sites 
within the dimer. On the contrary, it is also possible that the 
presence of an inhibitor could enhance the activity of MERS- 
CoV 3CLP’° to an extent that results in a complete loss of the 
temporal and spatial regulation of the enzymatic activity, 
thereby disrupting viral genome replication. Ramifications of 
ligand-induced dimerization and activation of MERS-CoV 
3CLP*®, as seen in the presence of lower concentrations of 
inhibitor, will need to be further explored in virus-infected 
cells. 
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